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Abstract. Let G be a compact group acting in a real vector space V . We obtain a 
number of inequalities relating the L°° norm of a matrix element of the representa- 
tion of G with its LP norm for p < oo. We apply our results to obtain approximation 
algorithms to find the maximum absolute value of a given multivariate polynomial 
over the unit sphere (in which case G is the orthogonal group) and for the multi- 
dimensional assignment problem, a hard problem of combinatorial optimization (in 
which case G is the symmetric group). 



Introduction 

A general problem of optimization has to do with finding the maximum (min- 
imum) value of a real valued function / : X — > M. Often, the set X is endowed 
with a probability measure a and the function f possesses a certain degree of sym- 

.et.y which allow, one to compute the *-th _ / /' efficiently at lea.t 

for small values of k. Thus one may ask how well the k-th moment approximates 
the maximum value. In this paper, we describe a fairly general situation where 
some simple and meaningful relations between the maximum and moments can be 
obtained. We provide two illustrations: one, continuous, has to do with optimiza- 
tion of multivariate polynomials on the unit sphere with possible applications to 
solving systems of real polynomial equations and the other, discrete, deals with op- 
timization on the symmetric group, namely, with the multidimensional assignment 
problem, a hard problem of combinatorial optimization. 
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(1.1) The general setting. Let G be a compact group with the Haar probabihty 
measure dg acting in a finite-dimensional real vector space V. To avoid dealing 
with various technical details, we assume that the representation G — > GL{V) 
is continuous, where the general linear group GL{V) is considered in its standard 
topology. 

Let us choose a vector v E V and a linear function i : V — > M. We consider the 
orbit {gv : g E G^ oi V and the resulting function / : G — > M defined by 

f{g)=i{gv). 

In other words, / is a matrix element in the representation of G. We are interested 
in the relation between the following quantities: 
The L°° norm of /: 

ll/lloo = max 1/(^)1 = max 

g&G geG 

The L'^^ norm of / for a positive integer k: 
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As we remarked earlier, for many examples in computational mathematics, the 
quantity ||/||oo is of considerable interest and is hard to compute whereas ||/||2fc 
is relatively easy to compute for moderate values of k. First, we relate ||/||oo and 
II/II2. 

(1.2) Theorem. Let G be a compact group acting in a finite- dimensional real 
vector space V and let dg be the Haar probability measure on G. Let us fix a 
vector V and a linear function £ : V — > M and let us define a real-valued function 
f:G^Rbyf{g)^e{gv). Then 

\\fh<\\f\\^<Vdh^-\\fh. 

The bounds of Theorem 1.2 are generally sharp, see Remark 2.4. To estimate 
how well ||/||2fc approximates ||/||oo for a larger k, we invoke a general construction 
from the representation theory, see for example. Lecture 6 of [4] . 

(1.3) Tensor power. For a positive integer k, let 

k times 

be the k-th tensor power of V. There is a natural action of G in V'^^, defined on 
decomposable tensors by 



g{vi (8) ... (8) Vk) = gv\ ® . . .® gvk for g & G. 
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There is a natural action of the symmetric group Sk permuting the components in 
the tensor product. Thus, for decomposable tensors, we have 



(8) . . . (8) Vfc) = fcr-i(i) ® • • • ® fcr-i(fc) for cr & Sk- 

The action of Sk commutes with the action of G. Let Symj^{V) be the symmetric 
part of V®'^ consisting of the tensors x such that ax = x for all a E Sk- It is known 
that 

dimSym^(]/) = 



since Sym^(y) can be thought of as the space of all real homogeneous polynomials 
of degree k in dim V variables. Let 

k times 

be the k-th tensor power of v. Thus t'®^ G Symk{V) and gv'^'' G Symk{V) for 
all g E G. It turns out that how well ||/||2fe approximates ||/||oo depends on the 
dimension Dk of the subspace spanned by the orbit ^gv'^'^^. This dimension may 
be different for different v E V. Roughly, if Dk is small then v lies in a certain 
algebraic variety constructed from the action of G in F and for such v the functions 
/ are "smoother" than for those v for which Dk is large. 
Thus we obtain the following corollary of Theorem 1.2. 

(1.4) Corollary. Let G be a compact group acting in a finite-dimensional real 
vector space V and let dg be the Haar probability measure on G. Let us fix a 

vector V and a linear function £ : V — > M and let us define a real-valued function 
f : G — > M by f{g) = i{gv). For a positive integer k, let 

Dk = dimspanjsrv®'' : g e G} 

be the dimension of the span of the orbit of v®^ in V®^ . Then 

\2k <\\f Woo <{Dk)^' - Wfhk- 



Again, generally speaking, the estimates of Corollary 1.4 can not be improved, 
see Remark 2.5. A straightforward estimate of Dk < dimSym^(F) produces the 
following corollary. 

(1.5) Corollary. Let G be a compact group acting in a finite- dimensional real 
vector space V and let dg be the Haar probability measure on G. Let us fix a 
vector V and a linear function £ : V — > M and let us define a real-valued function 
f : G — > M. by f{g) = £{gv). Let k be a positive integer. Then 

/dimF + A;-l\^ „ 
2k < ll/lloo < ^ j • ||/||2fe. 



There are examples showing that the bounds of Corollary 1.5 are "almost tight". 
For instance, if G = SO{n) is the orthogonal group acting in F = M", computations 
of Section 3.3 show that the upper bound for ||/||oo is tight up to a factor of 
(uniformly on k and n). 

As we remarked earlier, in many cases we are able to compute ||/||2fe eflficiently if 
k is not very large. Quite often (see examples of Sections 3 and 4), we can compute 
||/||2fc in polynomial time for any fixed k. The following estimate shows the type 
of bound that we can achieve if we fix /c in advance. 

(1.6) Corollary. For any e > there exists a ko = ko{e) = 0(e~^) such that for 
any positive integer k > kg, for any compact group G acting in a real vector space 
V with dimV > k, for any linear function £ : V — > M, for any v E V and for the 
function f{g) = i{gv), f : G — > W, we have 

||/||2fc<||/||oo<6Vdim^^-||/||2/c. 

The paper is structured as follows. In Section 2, we prove Theorem 1.2 and 
Corollaries 1.4-1.6. In Section 3, we apply our results to the problem of finding 
the largest absolute value of a real homogeneous multivariate polynomial on the 
unit sphere, in which case G = SO{n), the orthogonal group. In particular, we 
present a simple polynomial time approximation algorithm to compute the largest 
absolute value on the sphere of a fewnomial, that is, a polynomial having only 
small (fixed) number of monomials and discuss possible applications to solving 
systems of real fewnomial equations. In Section 4, we discuss a hard problem of 
combinatorial optimization, that is, the multidimensional assignment problem, in 
which case G — Sn- In particular, our results lead to an approximation algorithm 
for finding a bijection between vertex sets of two hypergraphs Hi and H2, which 
maximizes the number edges of Hi mapped onto the edges of H2. 

We use the real model (see [3]) for computational complexity, counting the num- 
ber of arithmetic operations performed by the algorithm. Eventually, to compute 
||/||2fe from ll/lll^ we need to extract a root of degree 2k, which we count as a single 
operation. 

2. Proofs 

In this section, we prove Theorem 1.2 and Corollaries 1.4-1.6. We need some 
standard facts from the representation theory (see, for example, [4]). 

Let G be a compact group acting in a finite-dimensional real vector space V. As 
is known, V possesses a G-invariant scalar product (): 

{u, v) = {gu, gv) for all u,v &V and all g & G. 

We introduce the corresponding Euclidean norm: 



The action (representation) is called irreducible if V contains no proper G-invariant 

subspaces. As is known, if G acts in a finite-dimensional real vector space V, then 
V can be represented as a direct sum of pairwise orthogonal (with respect to a 
given G-invariant scalar product) invariant subspaces Vi such that the action of G 
in each Vi is irreducible. 

A somewhat "non-standard" feature of our construction is that we consider rep- 
resentations over the real rather than over the complex numbers. Consequently, 
we need a substitute for Schur's Lemma. It comes in the form of the following 
observation. 

Suppose that q : V — > R is a G-invariant quadratic form, that is q{gx) = q{x) 
for all X & V and all g & G. We claim that the cigcnspaces of q are G-invariant 
subspaces. A possible way to see that is to notice that the unit eigenvectors of q are 
precisely the critical points of the restriction q : § — > M where § = ja; : ||a;|| = l} 
is the unit sphere in V. 

Our first lemma is a real version of the orthogonality relations for matrix ele- 
ments. 

(2.1) Lemma. Let G be a compact group acting in a finite-dimensional real vector 
space V endowed with a G-invariant scalar product (). Suppose that the represen- 
tation of G is irreducible and let dg be the Haar probability measure on G. Then 



Proof. Let us choose a vector v eV and let us define a quadratic form q : V — > R 



Clearly, q{x) is G-invariant: q{gx) = q{x) for all a: G and all g & G. Let A be the 
largest eigenvalue of q and let W be the corresponding eigenspace. Then W is an 
invariant subspace of V and hence W — V. Thus q{x) — A||a;|p for some A > 0. 

To find A, let us compute the trace of q. On one hand, we have tiq = XdimV. 

Let qg{x) = (x, gv)'^. Then qg is a quadratic form of rank 1 with the non-zero 
eigenvalue ||(7i'|P = ||f |P which corresponds to an eigenvector x = gv. Hence tr = 
||f IP and since g(a;) is the average of g^, we have tr g = W^W^- Thus A = ||fp/dim"l/. 
Hence 



and the proof follows. 

Now we use that every representation is a sum of irreducible representations. 




by 




(2.2) Lemma. Let G be a compact group acting in a finite- dimensional real vector 
space V endowed with a G-invariant scalar product (). Let dg be the Haar probability 

5 



measure on G. Let us fix a vector v & V . Then there exists a decomposition 
V = Vi® ■ ■ - (BVk ofV into the direct sum of non-zero pairwise orthogonal invariant 
subspaces such that for every x & V we have 



/ {x, gv)^ dg = y] 
Jg ~1 



^ " "2 lU, 112 



dim Vi 

where Xi and Vi are the orthogonal projections onto Vi of x and v respectively. 
Proof. Let us define a quadratic form q : V — > M by 

q{x) = / (x, gv)"^ dg. 



JG 

Tlien q is G-invariant, q{gx) — q{x) for all G G and all x E V . Thus the 
eigenspaces of q are G-invariant subspaces of V. Let us write every eigenspace as a 
direct sum of pairwise orthogonal invariant subspaces Vi such that the action of G 
in each Vi is irreducible. Thus we obtain the decomposition V = Vi® . . .®Vk and 
we have 

k 

where Xi is the orthogonal projection of x onto Vi and are non-negative numbers. 
To find Ai, let us choose a non-zero x & Vi. Then {x,gv) = {x,gvi) and by Lemma 
2.1, we get 

II l|2 II ||2 

/ \ \ II ii2 r^i ■ P 

q{x) = Xi\\x\\ = — - — - — , 
dim y/j 

from which 

Wv-P 
dimVi 

The proof now follows. □ 

(2.3) Remark. A decomposition F = Vi © . . . © of a representation into the 
direct sum of pairwise orthogonal irreducible components is not unique as long as 
some irreducible representation appear with a multiplicity greater than 1 (which 
means that the representations of G in some subspaces Vi are isomorphic). One 
can construct some simple examples showing that the decomposition of Lemma 2.2 
indeed depends on v. 

Now we are ready to prove Theorem 1.2. 

Proof of Theorem 1.2. The inequality 

2 < 
6 



is quite standard. Let us prove that 



loo < \/ dim V ■ \\j ||2. 
Let e be the identity in G. We note that it suffices to prove that 

|/(e)| = K(^)|< Vdim^^-||/||2, 
because the inequahty for f{g) = i{gv) would follow then by choosing a new vector 



v: 



new V := old v). 



Let us introduce a G-invariant scalar product () in V so that £{x) — (c, x) for 
some c E V and all x E V. Applying Lemma 2.2, we obtain a decomposition 
V = Vi Q) ■ ■ ■ (B Vk into the direct sum of pairwise orthogonal invariant subspaces 
such that 

|2 lU, ||2 



= / (c, gvf dg = y] 



\Ci 



1=1 



where q and f j are the orthogonal projections onto Vi of c and v respectively. We 
have 



/(e) = {c,v) = ^{ci.Vi 



and hence 



Let 



|/(e)|< J]|(c„.;,)|< J]||c, 



1=1 i=i 



Oii = 



VdimT^ 



Then 



^ k \ 2 k k 

|/(e)|' < Ij^ai^/dimVi) < af) (j]dimF,) = (dimF) 

^i=l ^ i=l i=l 



and the proof follows. □ 

(2.4) Remark. Analyzing the proof of Theorem 1.2, it is not hard to find out when 
the bound ||/||oo < VdimF • ||/||2 is sharp. In particular, the bound is sharp for 
the class of linear functions on the orbit of v as long as the orbit of v spans V. Here 
are some natural cases when the bound is attained. 

Suppose, for example, that we have an absolutely irreducible representation p 
of G in a real vector space W (that is, the representation remains irreducible after 
complexification). Thus, for every g E G, p{g) is an operator in W. We interpret 
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p{g) as a point in the space V = End{W) of all linear transformations W — > W. 
Let xid) = tr(5i) be the character of the representation. We think of xid) ^-s 
of a linear function on the orbit of the identity operator / G End(VF) under the 
action g{x) — p{g)x for all x G End(VF). Then ||x||oo = dimVF = V dim V. The 
orthogonality relations for the characters (see, for example, Chapter 2 of [4]) state 
that 11x11 2 = 1 and hence the bound of Theorem 1.2 holds with equality. 

As another example, let us consider a finite group G of cardinality \G\ and an 
arbitrary function / : G — > M. Of course, in this case, the inequality ||/||oo < 
^/\G\ ■ II /II 2 is the best we can hope for (take / to be the delta-function of an 
element of G). The function / can be thought of as a linear function on the orbit 
of a point in the regular representation of G. The space V in this case is the vector 
space of all linear functions / : G — > M where G acts by shifts: gf{x) = f{g~^x). 
Let V & V he the delta- function at the identity: v{e) = 1 where e is the identity 
in G and v{g) — for all g e. Then / is a linear function on the orbit of v and 
dimV= \G\. 

To prove Corollary 1.4, we use the construction of the tensor power (see Section 
1.3). 

Proof of Corollary 1.4- Let us define a function h : G — > M by 

Thus h is a linear function on the orbit of v'^^. Let 

W = span{gv®^ ■ 9 ^ G} 

be the span of the orbit of v"^^. Hence dim 14^ = Dk- Applying Theorem 1.2 to the 
linear function h on the orbit of v®'^ in W, we get 

||/i||2<||/i||oo</D;-||/i||2. 

Now we note that \\h\\oo = \\f\L and that \\h\\2 = \\f\\^,^. □ 

1 

(2.5) Remark. The bound ||/||oo < (-Dfc) ||/||2fc is rarely sharp. One example 
when it is sharp is provided by a generic orbit in the regular representation of a 
finite group G, see Remark 2.4. In Section 3.3, we present a series of examples of 
matrix functions for G = SO{n) for which the estimate is sharp up to a constant 
factor uniformly on k (and uniformly on n). 

Corollary 1.5 follows by a general estimate of D^. 

Proof of Corollary 1.5. We apply Corollary 1.4. The orbit \^gv®^^ lies in the 
symmetric part ^yrcL^iV) of the tensor product V®^ and hence 

Dfe = dimspanj^rv®'^} < dimSym^(V^) = 
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/dim V + k-l 



□ 



(2.6) Remark. As follows from Section 3.3, the upper bound, for ||/||c)o 

is sharp up 

to a constant factor uniformly on A; if G = SO{n), F = R"' and G acts in V by its 
defining representation. 

We describe below classes of functions / : G — > M for which some sharp esti- 
mates can be obtained. Let us fix a representation p of G in a real vector space 
V and let J^p be the vector space spanned by the matrix elements of p. Then for 
some constant C{p,k) and for all f e we have ||/||oo < ^(p, A;) ||/||2fc. Let us 
assume that p is absolutely irreducible (cf. Remark 2.4). In principle, the best 
possible value of C(p, k) can be computed from the representation theory of G as 
follows. Let us choose an / G jFp. Shifting /, if necessary, we may assume that 
the maximum absolute value of / is attained at the identity e of G. Let us define 
h:G — >Rhj 

h{x) = I f{g~^xg) dg for all x e G. 

Jg 

Then ||/i||oo = ||/||oo and \\h\\2k < II/II2A: for all positive integers k. Thus the largest 
ratio ||/||oo/||/||2fc ior / e jFp is attained when / satisfies f{g~^xg) — f{x) for all 
g E G and all x E G from which it follows that / is a multiple of the character 
x(^) = trp(^), see Remark 2.4. We observe that ||x||oo = dimF. Moreover, 
the orthogonality relations (see Lecture 6 of [4]) imply that is the sum of 

squares of multiplicities of the irreducible components of the tensor power p®^. 
Summarizing, we conclude that in order to be able to compute the best possible 
constant C{p,k) such that ||/||oo < C{p,k)\\f\\2k for any linear combination / of 
matrix elements of p, it suffices to know how the tensor power p®^ decomposes into 
the sum of absolutely irreducible representations. 

Finally, Corollary 1.6 follows by an estimate of the binomial coefficient. 

Proof of Corollary 1.6. Let us choose a /cq such that (fc!)^/^ > 2e~^ for all k > kg. 
By Stirling's formula, we can choose ko = 0(e~^). Then 

/dim V + k-l\^ /dim V ■ (dim V + !)••• (dim V + k - 



\ k J \ k\ 



< eVdimF. 



The proof follows by Corollary 1.5. □ 

3. Applications to Polynomials 

In this section, we apply our results to approximate the maximum absolute value 
of a homogeneous multivariate polynomial on the unit sphere. 
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Let p be a homogeneous polynomial of degree d in n real variables ^i, . . . ,Cn- 
Thus we can write 

P{x) ^ Yl lii-idUi---^id for a; = (^1,... ,Cn) 
l<ii,... ,id<n 

where are some real numbers. 

Let M"" be the n-dimensional Euclidean space and let x = (^i, . . . ,^^) G be 

a point. Then V = iW") can be identified with the space . The coordinates 
of a typical point (tensor) X & V are 

(Xi^,,,i^ : l<ii,... ,id<'nj 

and the scalar product in V is defined by 

l<ii,... ,id<n 

For X = {^i, . . . ,^n) & the coordinates of x^'^ are 

{^ii---^id foi" ^<'i'i,--- ,id<nj- 
Therefore, we can write 

p{x) = (c, a;®'') where c= (^7ii...id)- 

Let G = <S'0(n) be the group of orientation preserving orthogonal transformations 
of M"^. Then G acts in V by the d-th tensor power of its defining representation in 
M"^. Let us choose = (1, 0, . . . , 0) e M"^. Then, for any e G, we have 

and the orbit {gw : e G} is the unit sphere S"^"^ c W^. Thus the values of p{x), 
as X ranges over the unit sphere in M", are the values of the linear function 

£{gw®'^) = (c, gw^'^) = (c, x^'^) 

as ^ ranges over the orthogonal group SO{n). 

Moreover, the push-forward of the Haar probability measure dg on G is the 
probability measure dx on S"~^. Thus we connect the values of a polynomial 
on the unit sphere with the values of a linear function on the orbit of the group 
G = SO{n). 
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(3.1) Corollary. Letp be a homogeneous polynomial of degree d inn real variables, 

let S"^"^ be the unit sphere in and let dx be the rotation invariant probability 
measure on S"^"^. For a positive integer k, let us define the L^^ norm of p by 



\2k=[ I P^H^) dx 

and the L°° norm by 



Then 



oo = max \p{x)\. 



, kd + n 1\ 

2k < IIpIIoo < ( I \\p\\2k- 



Proof. We apply Corollary 1.5. Let w = (1,0, ... ,0) be as above. Then, for 
V = w'^^, we can write p{gw) = £{gv) for some linear functional £ : V — IR and 
all g E G. The dimension D]^ of the span of the orbit \^gv®^ = gw^'"^ : g G G} is 
that of the space of homogeneous polynomials of degree kd in n variables. Hence 



Dk 

We have 



kd + n — 1 
kd 



[ f^{gv) dg^ [ p^^{x) dx. 

JG Js"-i 



The proof now follows. □ 

One way to integrate polynomials over the unite sphere is to take the sum of 
the integrals of the monomials. The following result is certainly known, but for the 
sake of completeness, we sketch its proof here. 

(3.2) Lemma. Let p{x) = ■ ■ ■ be a monomial. If at least one of ai 's is odd 

then 

/ p{x) dx = 0. 

// ai — 2(3i, where (3i are non-negative integers for i — 1, . . . ,n, then 

,(,),,^ r(n/2)nt,r(ft + i/2) 

7r"/2r(ft + ... + /J„ + r./2)' 

where dx is the Haar probability measure on . 
Sketch of Proof. If ai is odd then 
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and hence the average value of p over the unit sphere is 0. 
Assuming that ctj = 2/?j for z = 1, . . . , n, we get 



„ Up n 

/ p{x)e-\\-\\' e^^e-^' = l[r{l3, + 1/2), 

where ^ is the standard Lebesgue measure in M". 

On the other hand, passing to the polar coordinates and using that p is homo- 
geneous of degree d = 2(/3i, + . . . + (3n), we get 



/ p(a;)e-ll^ll' d/x - ■ ( / p{x) dx) ■ ( [ r'^+n-i^-r^ 

where jS"'"^] = /T{n/2) is the Euclidean volume of the unit sphere in W^. 

The proof now follows. □ 

The estimates of Corollary 3.1 are probably not optimal (apart from the case of 
/c = 1), but the following simple example shows that in some sense, they are close 
to being optimal. 

(3.3) Powers of linear functions. Let p be the power of a linear function, for 
example, p{x) = ^f. Then ||p||oo = 1 and, by Lemma 3.2, 



2k — 



T{n/2)T{kd+ 1/2) 
■^T{kd + n/2) 

Then Corollary 3.1 gives us the estimate 



/ T{n/2)T{kd + l/2)T{kd + n) \ ^ ^ f V{n/2)T{kd + n) \^ 

\ ^V{kd + n/2)V{n)V{kd + 1) J " \ V{kd + n/2)V{n) _ 

n{n+l)---{kd + n-l) 
(n/2)(n/2 +l)---{kd + n/2 - 1) 



< 22 



Hence, among all homogeneous polynomials of a given degree d, powers of linear 
functions give the largest ratio ||/||oo/||/||2A; up to a constant factor depending on 
the degree of / and independent of the number of variables n and the value of k. 
This may serve as an indication that the bound of Corollary 1.5 are not too bad, 
cf. Remarks 2.5 and 2.6. G. Blekherman [2] pointed out to the author that the 
powers, in general, do not provide exactly the largest ratio ||/||oo/||/||2fe among all 
polynomials of a given degree d. Such "extremal" polynomials / were computed 
by G. Blekherman when some of the parameters n, d and k are small. 

Suppose we want to approximate ||p||oo by \\p\\2k for a sufficiently large k. Let 
us see what trade-off between between the computational complexity and accuracy 
can we achieve. 
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(3.4) Low degree polynomials. Let us fix the degree d and allow the number of 

variables to vary. Suppose that we are given a homogeneous polynomial p of degree 
d and that we want to estimate ||p||oo- This problem is provably computationally 
hard already ior d — 4 (one can infer it from results of Part 1 of [3] ) and is suspected 
to be hard for d = 3. 

Let m be the number of monomials in so m = 0(n'^). We observe that for 
any fixed /c, the direct computation of p^^{x) and computing ||p||2fc via Lemma 3.2 
has 0{rn^'^) complexity. One the other hand, using Corollary 3.1, we get that 

\\p\\oo<C{k)n''/^\\p\\2k where C{k) = 0{k-^/^). 

In other words, for any fixed e > there is a polynomial time algorithm estimating 
Hp I loo within a factor of en^l"^ . If we want a better estimate, we have to take a 
larger k. Thus, for any constant C > 1, from Corollary 3.1 (cf. also Corollary 1.6), 
we get that 

3 < C'Iblbfc for some k = 0{n). 



Since p^'^{x) contains at most (^'^'^2^^"'^) monomials, we can compute \\p\\2k by 
Lemma 3.2 in 2^^'^^ time. Summarizing, for any C > 1 there exists a 7 > such 
that we can approximate ||p||oo within a factor C in 2"''^ time. 

(3.5) Fewnomials. Suppose that we do not fix the degree d of p but fix instead 
the number m of monomials in p. Thus we can write 

i=l 

where 

are monomials. For a positive integer k, by the multinomial expansion, we get 

mi 



ri,... ,r„i>0 
ri + ...+rm=2k 



Thus p'^^ contains at most (J^^^i ^) monomials, which is a polynomial in k when 
m is fixed. Using Lemma 3.2, we compute ||p||2A; in 0((in(2/c)"*) time. Given an 
e > 0, let us choose an integer k = 0{e~^n'^ Ind) such that 

n — I 

——\ii{kd+l) <ln(l + e). 

Using Corollary 3.1 and a simple estimate 

kd + n-l\ _ {kd+l){kd + 2)---{kd + n-l) ^ ^^^^-^^n-i 



n-1 J l-2---(n-l) 
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we conclude that 

|b||2fc<|b||oo<(l + 6)-|b||2fe. 

Hence as long as the number of monomials is fixed, we get a polynomial time ap- 
proximation algorithm, which, for any given e > computes the maximum absolute 
value of a given polynomial ( "fewnomial" ) over the unit sphere within a relative 
error of e, in time polynomial in e~^, the number of variables n and the degree 
d of the polynomial. In fact, the only place where we have to use polynomially 
many in d arithmetic operations is when we compute gamma-functions (factorials) 
in Lemma 3.2. Apart from that, the running time of the algorithm is polynomial 
in Ind. 

Computing or approximating the maximum absolute value of a polynomial on 
the unit sphere can be used for testing whether a given system of real polynomial 
equations has a real solution, a difficult and important problem, sec for example, 
[3] and [7]. Suppose that pf. i = 1, . . . , s are given homogeneous polynomials of 
degree din n variables x = (^i, . . . ,Cn) ctnd that we would like to test whether the 
system 

Pi{x) = for i = 1, . . . , s 
has a real solution x ^ 0. Let 

s 

Thus we want to test whether 

min q{x) = 0. 

Let us choose a 

7 > max q{x) 

and let 

p = 7||a;||^'^ — q. 
Thus the problem reduces to checking whether 

max \p{x) I = 7. 

If the polynomials pi of the original system did not have too many monomials, 
we can try to approximate |b||oo by ||p||2fc for a reasonably large k, cf. Section 
3.5. Similarly, to choose an appropriate 7, we can compute \\q\\2k for ^ sufficiently 
large k. The number of monomials in the system is relevant to the "topological 
complexity" of the set of real solutions [5], so it should not be surprising that 
it is also relevant to the computational complexity of the decision problem. In 
particular, this approach may be useful for detecting "badly unsolvable" systems 
(systems for which the value of |b||oo is substantially smaller than 7) of fewnomial 
equations. 
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4. Applications to Combinatorial Optimization 

Let us fix a number d and let V = = R""* be the vector space of d- 

dimensional arrays (tensors) 



X 



To simplify the notation somewhat, we denote the coordinates of X by xj, where 
/= (ii,... ,id)- 

We introduce the scalar product by 

(X, y ) = ^ x/y/ for / = (1 < H, . . . ,Zd < n). 



Let G = Sn he the symmetric group of all permutations g of the set {!,... ,n}. 
Wc introduce the action of Sn on V by the d-th tensor power of the natural action 
of Sn in M": 

Y = gX provided xi^ygi where g{ii, . . . ,id) ^ {g{ii), ■ ■ ■ ,g{id))- 
Let us choose two tensors A,BeV and let 

fig) = (B, gA) = ai^...ubg^n)...g{ia), f ■ Sn — >^ 

l<ii,... ,id<n 

be the corresponding matrix element. 

The problem of maximizing (minimizing) / is one of the most general problems of 
combinatorial optimization, known as the d-dimensional assignment problem (see, 
for example, [6]). It is straightforward for d = 1 but already quite difficult ioi d — 2 
(see [1]). 

(4.1) Example: hypergraphs. Recall that a d-hypergraph H on the set 
{!,... , n} is a set of subsets E C {1, . . . , n}, called edges of H, such that \E\ < d for 
the cardinality of every edge E of H. A hypergraph is called uniform provided 
\E\ = d for every edge E of H. Let Hi and H2 be uniform (i-hypergraphs with the 
set of vertices {!,... , n}. Let us define the adjacency tensor A = {ai^...id) of Hi by 



1 if {zi, . . . , id} is an edge of Hi 
otherwise. 



Let us define B = (6ii...i^) by: 



ii...td 



_ J d! 



if {ii,... , id} is an edge of H2 



otherwise. 
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A permutation g of the set {1,... ,n} is interpreted as a bijection between the 
vertices of H2 and the vertices of Hi and the value of 

f{g) = (S, gA) 

is the number of edges of H2 mapped onto the edges of H\. The value of ||/||oo is 
the maximum number of edges of Hi and H2 that can be matched by a bijection 
of the vertices of Hi and H2- If Hi and H2 are not uniform, we can modify B by 
letting 

_ ki\---kr\ 

provided {ii, . . . ,2^} is an edge of H2 and the multiplicities of the elements in the 
multiset {{«i, • • • , id}} are fci, . . . , fc^., so that /ci + . . . + /c^ = (i. Then again the 
value of II /II 00 is equal to the maximum number of edges of Hi and H2 that can be 
matched by a bijection of the vertex sets. 

One can extend this construction to oriented hypergraphs whose edges are or- 
dered subsets of {1, . . . , n}. By introducing weights on the edges of Hi and H2 we 
can introduce "prices" for matching (or mismatching) particular edges. 

Applying Corollary 1.5, we get the inequality 

1 

(„d 1 L _ 1 \ 2fc 
k ) 11-^11^'^ 

for the function / of a general d-dimensional assignment problem. 

In various special cases, the bound can be somewhat improved by using Corollary 
1.4. For example, if the coordinates of A (or B) are O's and I's, one can prove that 

k 

||/||2fc<||/||oo<i^^(n,d,A;)-||/||2fc where D{n,d,k) = Y, 

i=i 

We claim that for small (fixed) values of k the value of ||/||2fe can be computed 
relatively easily (in polynomial time). First, we observe that computation of ||/||2fc 
reduces to computation of the average of a matrix element for larger tensors. 

(4.2) Lemma. Let us fix two tensors A = (a/) and B = (bi) for 

/ = (1 < ii, . . . , id < n) . For a positive integer m (in particular, for m = 2k), let 

us define tensors X — A®"^ and Y = B®'^ as follows: 

X = {xj) and Y = (yj) where J = (l < ji, . . . , jdm < n) 

and where 

xj = ai-,--- ai^ and yj = bi^ ■ ■ ■ bi^ provided J = (h, . . . , Im)- 
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Then 

1 



Proof. The proof follows by observation that 

{B, gAY = (S®-, gA^^) = {Y, gX). 

□ 

Next, we show how to compute the average. 

(4.3) Lemma. Let us fix a positive integer I (in particular, I = md = 2kd). For a 
partition E = |Si, ... , j of the set {1, . . . , /} into non-empty disjoint subsets, we 
say that a sequence I = (ii, . . . has type S if for each Sp the indices ij : j & T,p 
are all equal and if for each pair of subsets Sp and E^ the indices ij : j & Hp and 
ij : j E Tiq are different. 

Let X = (xi) and Y = (yi), I = {l < ii, ■ ■ ■ , ii < n) be tensors (in particular, 
we can have X = A®"^ = A®^'^ and Y = S®'" = B®'^''). 

Let us define the tensors X — (xj) and Y = (yj), I = {l < ii, ■ ■ ■ , ii < n) by 

xi = ; — xj provided type I — (Ei, . . . , E^.) 

nl ^-^ 

J: type of J= type of / 



and 



j — - z2 provided type I = (Ei, . . . , E^). 



Vi , 

nl 

J: type J= type / 



Then 

Proof. The two index sets / = (u, . . . ,ii) and J = (ii,... belong to the 
same orbit of the action / i — > gl of Sn if and only if they have the same type 
|Ei, . . . , Ej. j. Moreover, the stabilizer of / consists of (n — r)! permutations. Hence 

X=lv^x and y = lv^r. 

n! ^-^ n\ ^-^ 

g^Sn g^Sn 



We have 



g^Sn g&Sn g&Sn 



and the proof follows. □ 
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Combining Lemmas 4.2 and 4.3, we observe that as long as d and k are fixed, 

we can compute II/II2A; in 0(n'^^'^^ time, that is, in polynomial in n time. 

In particular, from Corollary 1.6, we conclude that for any fixed d and for any 
fixed e > there exists a polynomial in n algorithm for estimating ||/||oo within a 
factor of en'^l'^ . This result seems to be new already for d = 2, cf. [1]. 

Remark. So far we have shown how to approximate ||/||oo by ||/||2A; but we 
did not discuss how to find a particular permutation g which gives the value of 
\f{g)\ close to ||/||oo- In fact, it is not hard to construct a permutation g & Sn for 
which \ f{g) \ > ll/lhfc and hence approximates ||/||oo within a factor of en'^/^ 

at the cost of some extra work, which still results in a polynomial time algorithm 
when k is fixed. The idea is to use the "divide-and-conquer" approach. We split 
the symmetric group Sn into the union of cosets Sj — [g : g{l) = j} and then 
compute the average value of /^'^ over each coset separately (this would require 
some straightforward modification of Lemma 4.3). Then a coset should be chosen 
which gives the largest average. Thus we have determined g{l) = j and we proceed 
to determine g{2), . . . , g{n) successively. 
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